[LoRA] Adds example on text2image fine-tuning with LoRA #2031

sayakpaul · 2023-01-18T18:12:28Z

Very much copied from @patrickvonplaten's awesome work with #1884

Things seem to be working on both T4 and V100.

My command:

export MODEL_NAME="CompVis/stable-diffusion-v1-4"
export DATASET_NAME="lambdalabs/pokemon-blip-captions"

accelerate launch --mixed_precision="fp16" \
  train_text_to_image_lora.py \
  --pretrained_model_name_or_path=$MODEL_NAME \
  --dataset_name=$DATASET_NAME --caption_column="text" \
  --resolution=512 --random_flip \
  --train_batch_size=1 \
  --num_train_epochs=100 --checkpointing_steps=5000 \
  --learning_rate=1e-04 --lr_scheduler="constant" --lr_warmup_steps=0 \
  --seed=42 \
  --enable_xformers_memory_efficient_attention \
  --validation_prompt="cute Sundar Pichai creature" --report_to="wandb" \
  --output_dir="sd-model-finetuned-lora-t4" \
  --push_to_hub && sudo shutdown now

The final weights will be pushed to https://huggingface.co/sayakpaul/sd-model-finetuned-lora-v100 and an experimentation run is available here: https://wandb.ai/sayakpaul/text2image-fine-tune/runs/3iiol807 (currently running). Once these are done, I will update the appropriate sections in the README.

HuggingFaceDocBuilderDev · 2023-01-18T18:20:44Z

The documentation is not available anymore as the PR was closed or merged.

examples/text_to_image/README.md

examples/text_to_image/train_text_to_image_lora.py

patil-suraj

Thanks a lot for working on this! Just left some nits. And we should also add the model card creation function in this script. cf #2032

examples/text_to_image/README.md

examples/text_to_image/train_text_to_image_lora.py

patil-suraj · 2023-01-19T14:46:40Z

examples/text_to_image/train_text_to_image_lora.py

+    state_dict = lora_layers.state_dict()
+    lora_layers.load_state_dict(state_dict)
+
+    accelerator.register_for_checkpointing(lora_layers)


Think we actually don't need it. If we are passing lora_layers to accelerate.prepare and if it's nn.Module it will be automatically checkpointed.

It's in accelerator.prepare so we can remove it here

examples/text_to_image/train_text_to_image_lora.py

patrickvonplaten

Super nice! Great that you made it work so quickly. Think only a couple of things are left to do :-)

Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

…ce/diffusers into add/lora-text-to-image

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

sayakpaul · 2023-01-20T11:12:44Z

@patil-suraj @patrickvonplaten I think we're good to go :)

patrickvonplaten · 2023-01-23T07:13:30Z

examples/text_to_image/README.md

+
+**___Note: When using LoRA we can use a much higher learning rate compared to non-LoRA fine-tuning. Here we use *1e-4* instead of the usual *1e-5*. Also, by using LoRA, it's possible to run `train_text_to_image_lora.py` in consumer GPUs like T4 or V100.**
+
+The final LoRA embedding weights have been uploaded to [sayakpaul/sd-model-finetuned-lora-t4](https://huggingface.co/sayakpaul/sd-model-finetuned-lora-t4). **___Note: [The final weights](https://huggingface.co/sayakpaul/sd-model-finetuned-lora-t4/blob/main/pytorch_lora_weights.bin) are only 3 MB in size, which is orders of magnitudes smaller than the original model.**


examples/text_to_image/requirements.txt

examples/text_to_image/train_text_to_image_lora.py

…ce/diffusers into add/lora-text-to-image

patrickvonplaten · 2023-01-23T07:31:02Z

Thanks a lot for your work here @sayakpaul - let's try to advertise it nicely now :-)

example on fine-tuning with LoRA.

189f7c3

sayakpaul self-assigned this Jan 18, 2023

apply make quality.

0fe84df

sayakpaul requested review from patrickvonplaten and patil-suraj January 18, 2023 18:28

sayakpaul changed the title ~~example on fine-tuning with LoRA.~~ [LoRA] Adds example on text2image fine-tuning with LoRA Jan 18, 2023

sayakpaul marked this pull request as ready for review January 18, 2023 18:29

sayakpaul mentioned this pull request Jan 18, 2023

Add support for fine-tuning with LoRA (text2image example) #2002

Closed

sayakpaul and others added 3 commits January 19, 2023 10:28

Merge branch 'main' into add/lora-text-to-image

aaf2603

fix: pipeline loading.

2b908d4

Merge branch 'main' into add/lora-text-to-image

1f703e1